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Abstract: We prove a quenched large deviation principle (LDP) for a simple random walk 
on a supercritical percolation cluster on Z'^, d > 2. We take the point of view of the moving 
particle and first prove a quenched LDP for the distribution of the pair empirical measures of 
the environment Markov chain. Via a contraction principle, this reduces easily to a quenched 
LDP for the distribution of the mean velocity of the random walk and both rate functions ad¬ 
mit explicit (variational) formulas. Our results are based on invoking ergodicity arguments in 
this non-elliptic set up to control the growth of gradient functions (correctors ) which come up 
naturally via convex variational analysis in the context of homogenization of random Hamil¬ 
ton Jacobi Bellman equations along the arguments of Kosygina, Rezakhanlou and Varadhan 
i [KRV06] i. Although enjoying some similarities, our gradient function is structurally dif¬ 
ferent from the classical Kipnis-Varadhan corrector, a well-studied object in the context of 
reversible random motions in random media. 

1. Introduction and main results 


1.1 The model. 

We consider a simple random walk on the infinite cluster of a supercritical bond percolation on Z'^, 
d > 2. Conditional on the event that the origin lies in the infinite open cluster, it is known that a 
law of large numbers and quenched central limit theorem hold, see Sidoravicius and Sznitman ( [SS04] ) 
for d > 4 and Mathieu- Piatnitski ( |MP07] ) and Berger- Biskup ( |BB07| ) for any d > 2. However, 
treatment of such standard questions for this model needs care because of its inherent non-ellipticity- 
a problem which permeates in several forms in the above mentioned literature. 

Questions on quenched large deviations for general random motions in random environments have 
also been studied (see section 2.1 for a detailed review on the existing literature) in a fundamental 
work of Kosygina-Rezakhanlou-Varadhan ( [KRV06] ) for a diffusion with random drift and in related 
work of Rosenbluth ( |R06] ) and Yilmaz ( [Y08| ) for general random walks in random environments. 
Other results of relevance are by Rassoul-Agha, Seppalainen and Yilmaz (see |RSY13] ) on directed, 
undirected and stretched polymers in a random (and possibly unbounded) potential. All these re¬ 
sults, however, require certain moment conditions on the environment {ellipticity) which needs to be 
necessarily dropped when studying a non-elliptic model, for example, the classical simple random 
walk on the supercritical percolation cluster (SRWPC). In this context, it is the goal of the present 
article to study quenched large deviation principles for the distribution of the empirical measures of 
the environment Markov chain of SRWPC {level- 2) and subsequently deduce the particle dynamics 
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of the rescaled location (level -1) of the walk on the cluster. We start with the precise description of 
the model of classical bond percolation on 

We fix d > 2 and denote by the set of nearest neighbor edges of the lattice and by the 
set of edges from the origin to its nearest neighbor. Let n = {0,1}®'* be the space of all percolation 
eonfigurations oj = (oJb)b&d- other words, a;;, = 1 refers to the edge b being present or open, while 
Ldb = 0 implies that it is vacant or closed. Let B be the Borel-a-algebra on Q defined by the product 
topology. We fix the pereolation parameter p G (0,1) and denote by P = Pp ;= (p6i + (1 — p)6oj 
the product measure with marginals P(a;f, = 1) = p = I — P(a;fe = 0). Note that Z*^ acts as a group 
on (n, B, P) via translations. In other words, for each x G Z'^, Tx ■ —)• acts as a shift given by 

(TxOj)b = ojx+b- Note that the product measure P is invariant under this action. 

For each w G fl, let Coo(w) = {x G Z'^; x i — oo} denote the set of points x G Z'^, which finds an 
infinite self-avoiding path using occupied bonds in the configuration co. It is known that there is a 
critical percolation probability pc = Pc(d) which is the inhmum of all p’s such that P(0 G Coo) > 0- In 
this paper we only consider the case p > pc- 

For p > Pc, the set Coo(oj) is P-almost surely non-empty and connected. Let Hq = {0 £ Coo}- For 
p > Pc we define the conditional probability Pq by 


Po(^) =P(yl|fIo) A€B. 


We now define a (discrete time) simple random walk on the supercritical percolation cluster Coo as 
follows. Let the walk start at the origin and at each unit of time, the walk moves to a nearest neighbor 
site chosen uniformly at random from the accessible neighbors. More precisely, for each a; G flo, x G Z^ 
and e G Urf, we set 


7r^(x,e) 


^{a;e=l} ° 

Z)|e'|=l = ° 


G [0,1], 


( 1 . 1 ) 


and define a simple random walk X = {Xn)n>o as a Markov chain taking values in Z'’* with the 
transition probabilities 


P^’‘^iXo = 0 ) = 1 , 

PQ’‘^(Xn+i = x + e\Xn = x) = 7r^(x,e). 


( 1 . 2 ) 


This is a canonical way to “put” the Markov chain on the infinite cluster Coo and henceforth, we refer 
to this Markov chain as the simple random walk on the percolation cluster (SRWPC). 


1.2 Main results: Quenched large deviation principle. 

For each uj G Hq, we consider the process (rx„w)n>o which is a Markov chain taking values in the 
space of environments flo- This is the environment seen from the particle and plays an important role 
in the present context, see section 3.1 for a detailed description. We denote by 


C 


n 


1 

n 


n—1 

n=0 


(1.3) 


the empirical measure of the environment Markov chain and the nearest neighbor steps of the SRWPC 
(Xn)n>o- This is a random element of Ali(no x U^), the space of probability measures on flo x U^, 
which is compact when equipped with the weak topology (note that, Hq C fl is closed and hence 
compact). 

We note that, via the mapping (ui, e) i-A- (oj, TcUj) the space 2Vfi(no x U^) is embedded into Mi(no x 
Hq), and hence, any element jj, G Mi(no x U^) can be thought of as the pair empirical measure of the 
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environment Markov chain. In this terminology, we can define its marginal distributions by 

d(//)i(a;) = 

eGVd 

d{^l) 2 {uJ)= d/i(a;',e) = ^ d/i(r_eW,e). 

e: TeCo'=oj e€Ud 

A relevant subspace of Adi (Ho x U^) is given by 

and Pq- almost surely, 
> 0 if and only if u{e) = 1 for e € 


(1.4) 


Ml = MUno X Urf) = <J /i G Mii^o X Urf): (m)i = (/r)2 < 

e 


(1.5) 


d(/r)i(a;) 

A simple, though important, relation between the space Ad* and the environment process {Tx„uj)n>o is 
made transparent in section 3.1- elements in Ad^ are in one-to-one correspondence to Markov kernels 
on rio which admit invariant probability measures which are absolutely continuous with respect to Pq, 
see Lemma l3.ll 

Finally, we define a relative entropy functional 3 : Adi(no ^ U^) [OjOo] via 

dfi{cj,e) 


j(^) = I ^^0 Eeeu, e) log d(^),(yj( 0 ,e) if^ Ml, 

else. 


oo 


( 1 . 6 ) 


For every continuous, bounded and real valued function / on Hq x U^, we denote by 

J*(/) = sup {{f,n)-3{n)} 
lt€A4i(OoxU(j) 

the Fenchel-Legendre transform of J(-) and by 3**(/r), for any fj. G Adi(llo x U^), the Fenchel-Legendre 
transform of fJ*(-)- 

We are now ready to state the main result of this paper, which proves a large deviation principle 
for the distributions and Pq’'^^ ^ on Adi(Oo x U^) and respectively. Both statements 

hold true for Pq- almost every cu G llo- Ih other words our results concern quenched large deviations 
and share close analogy to the results by Rosenbluth ( |R06] i and Yilmaz f [Y08] i for random works in 
random environments. In the present context, due to zero transition probabilities of the SRWPC, we 
necessarily have to drop their assumption requiring p-th moment of the logarithm of the random walk 
transition probabilities being finite, for p > d. Here is the statement of our main result. 

Theorem 1.1 (Quenched LDP for the pair empirical measures). Let d > 2 and p > Pc{d). Then 
for Pq- almost every oj G Dq, the distributions of Cn under satisfies a large deviation principle in 
the space of probability measures on Adi(Do x U^) equipped with the weak topology. The rate function 
J** is the double Fenchel-Legendre transform of the functional 3. Furthermore, 3** is convex and has 
compact level sets. 

In other words, for every closed set C C A4i(Do x U^), every open set Q C A4i(Do x U^) and Pq- 
almost every w G Dq, 

lim sup — log Pq (£,1 G C) < — inf 3**{/a), (1-7) 


and 


n—>-co 


1 


f-L^C 


limsup — logPg’‘^(£n G ^) > — inf 3**{p). 

n^oo 


( 1 . 8 ) 
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Remark 1 The functional 3 is convex on ^ Ud); but fails to be lower semicontinuous, see 

Lemma [621 Hence, 3** ^ 3. 


Theorem o is an easy corollary to the existence of the limit 

exp E / Xk - Xk-i) 


lim - logHo’^^l exp{n(/,/:„)}} = lim -logLig’^ 

n^oo fi vj i. \ / j j n—>-oo fi ^ 


n—1 


k=0 


for every continuous, bounded function f on Qq x and the symbol (/, fi) denotes, in this context, 
the integral dPo(ti;) XleGUd e)dfj,(uj, e). We formulate it as a theorem. 

Theorem 1.2 (Logarithmic moment generating functions). For d > 2, p > Pc{d) and every continuous 
and bounded function f on FLq x U^, 

^ n—1 

Pq — a.s. 

k=0 


^l^-logHo’‘^| exp f'^f{Tx^uj,Xk - Xfc-i)') I = sup^ {(/,//) - a(^)} 

n CO fi y k=() /^eA4* 


We will first prove Theorem 11.21 and deduce Theorem 11.11 directly. 
Note that via the contraction map ^ : Ali(Ho x U(i) —M'^, 


p i-A 


/ '^edp{u,e), 

JQo g 


we have f,{Cn) = 


_ Xn—^O _ Xn 


Our second main result is the following corollary to Theorem ll.il 


Corollary 1.3 (Quenched LDP for the mean velocity of SRWPC). Let d > 2 and p > Pc{d). Then 
(Fil g satisfies a large deviation principle with a rate function 


the distributions „ 


J{x)= inf 3{p) 


X G 


(1.9) 


Remark 2 Note that Corollary II..31 has been obtained by Kubota ( |K12] ) for the SRWPC (see also the 
results of J.C. Mourrat l [M12| i for similar work in the context of random walks in random potential), 
though with transition probabilities slightly different from ours (Kubota considered a random walk 
on the cluster which picks a neighbor at random and if the corresponding edge is occupied, the walk 
moves to its neighbor. If the edge is vacant, the move is suppressed, i.e., the walk is lazy. Note that 
in our model the random walk takes no pauses, i.e., the random walk is agile). However, his method 
of proof as well as the description of the rate function J is completely different from ours. See section 
2.1 for a comparison of results and proof techniques. 


1.3 Literature review 


In d = 1, Greven and den Hollander ( |GdH94] 1 derived a quenched large deviation principle for the 
mean velocity of a random walk in i.i.d. random environment based on techniques from branching 
processes and obtained a formula for the rate function. Indepedently, using passage times on Z, 
Comets, Gantert and Zeitouni l [GGZOd] l derived the LDP (also in the annealed and functional form) 
for stationary and ergodic environments. For d > 1, Zerner ( [Z98| . see also Sznitman ( [S94j ) for 
Brownian motion in a Poissonian potential) proved a quenched LDP under the assumption that 
— Iog7r(0, e) has finite d-th moment and that there exists arbitrary large regions in the lattice where 
the local drifts ^g e7r(0, e) “points towards the origin” (the nestling property). His method is based on 
proving shape theorems and deriving the LDP, the driving force here being the sub-additive ergodic 
theorem. Invoking the sub-additivity more directly, Varadhan ( [VD.l] ) proved a quenched LDP without 
assuming the nestling assumption, but for uniformly elliptic environments. 
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Kosygina, Rezakhanlou and Varadhan ( [KRVOGj ) derived a novel method for proving quenched LDP 
using the environment seen from the particle in the context of a diffusion with a random drift assuming 
some regularity conditions on the drift. This method goes parallel to quenched homogenization of ran¬ 
dom Hamilton- Jacobi- Bellman equations, see section [2] for a review. Rosenbluth ( [BOG] ) invoked this 
theory to multidimensional random walks in random environments and obtained a rate function given 
by the dual of the effective Hamiltonian which admits a formula. The regularity assumption ( |KBVn6j ) 
on the Hamiltonian under which homogenization takes place (or quenched large deviation principle 
hold) now translates to the assumption that — log 7r(0, e) has finite d + e moment, e > 0, of Rosenbluth 
( |R06| i. Under this moment assumption, Yilmaz extended this to a level- 2 quenched LDP (as in The¬ 
orem fLTi) and subsequently Rassoul-Agha and Seppalainen ( |R.Sllj ) proved a level-3 (process level) 
LDP getting variational formulas for the corresponding rate functions. This method has been further 
exploited for studying quenched LDP and free energy for (directed and non-directed) random walks 
in a random potential V = — logvr, see Rassoul-Agha, Seppalainen and Yilmaz f |RSY13] i and results 
concerning log-gamma polymers of Georgiou, Raggoul-Agha, Seppalainen and Yilmaz ( |GRSY13] ) (see 
also |RSY14| and [GRS14] for related models). All these results, though seeing significant achieve¬ 
ments, work only under the standing assumption V = — logTT € L^(P) for p > d and do not cover 
the case V = oo, pertinent to the case of a random walk on a supercritical percolation cluster we are 
interested in. 

As mentioned before, Kubota f |K12| L based on the method of Zerner f [Z98| i (see also Mourrat 
( |M12j i proved a quenched LDP for the mean velocity of the random walk ^ on a supercritical 
percolation cluster, which is very close to Gorollarv 11.31 He also used sub-addtivity and overcame the 
lack of the moment criterion of Zerner by using classical results about the geometry of the percolation. 
This way he obtained a rate function which is convex and is the given by the Legendre transform of 
the Lyapunov exponents derived by Zerner |Z98| ). However, using the sub-additive ergodic theorem 
one does not get a satisfactory formula for the rate function, nor does this method seem amenable for 
deriving a level 2 quenched LDP as in Theorem 11.11 

1.4 Outline. 

Let us now turn to a sketch of the proof of Theorem 11.21 whose guiding philosophy is based on the 
ideas of |KRV06] . A rough idea is to first work on the space of environments to obtain certain ergodic 
properties to derive the lower bound, which, by variational techniques can be shown to be an upper 
bound as well, provided that a certain class of gradient functions, which show up naturally in the 
variational analysis, have a sub-linear growth at infinity. Controlling this growth (which is crucial for 
the upper bound) requires certain regularity properties (see (|2.7ll i of the random drift (or the moment 
conditions of |R06] and |Y08| i which we necessarily drop for the SRWPC and prove the sub-linearity 
of the gradient functions using ergodicity and geometric properties of percolation. An upshot of the 
aforementioned variational analysis for our case is the boundedness of the gradient functions on the 
cluster (our effective Hamiltonian does not grow), a key information for proving their sub-linear growth 
property. 

We end this section summarizing the organization of the rest of the article. Section 2 reviews the 
approach of |KRVr)6| shortly. Although this is well-known, we underline its main ideas to make our 
proof a bit more transparent and put it into context. Sections 3, 4 and 5 are devoted to the proof of 
Theorem 11.21 Section 3 proves the lower bound using some ergodicity arguments of Markov chains on 
environments, where arbitrary Markov kernels possibly admit zero transition probabilities. Section 
4 introduces a certain class of gradients or correctors (this should not be confused with the classical 
Kipnis- Varadhan corrector, see Remark [3] below), which admit a sub-linear growth on the cluster at 
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infinity, which is the main step for deriving the upper bound in section 5. Theorem 11.11 and Corollary 
II..11 are easily deduced from Theorem 11.21 in section 6. 

Remark 3 In our proof, as mentioned, a class Qoo of gradient functions show up naturally in the 
context of variational analysis (see section 4 and section 5.2). These objects share close similarities 
to the Kipnis- Varadhan corrector which is a central object of interest for reversible random motions 
in random media. Particularly for SRWPC this is crucial for proving a central limit theorem f [SS04] . 
|MP07] . [BB07] 1- the corrector expresses the distance between the random walk and a (harmonic) 
embedding of the cluster in where the random walk becomes a martingale. Finer quantitative 
questions (for example, existence of all moments) are of interest, see related work of Lamacz, Neukamm 
and Otto f |LNQ13] 1 on a similar model of percolation with all bonds parallel to the direction ei being 
declared open. However, our functions in class Qoo are structurally different from the Kipnis- Varadhan 
corrector. Though they share similar properties as gradients, objects in class ^oo, in particular, miss 
the above mentioned harmonicity of the Kipnis- Varadhan corrector. Large deviation (lower) bounds 
are based on a certain tilt which spoils the inherent reversibility of the model, a crucial base of Kipnis- 
Varadhan theory. 


2. Input from quenched homogenization: Main idea of our proof 


As mentioned, we follow [KRV06| and work with a diffusion in random drift. The notation used 
in this section should be treated independently from other parts of this article. Let (H,J-', P) be a 
probability space on which acts as an additive group of translations. Let b : H —be a nice 
vector field and let us consider a random diffusion (Vi)t>o on whose inhnitesimal generator is given 

by 

(^[“^u)(x) = ^An(x) -k {h,^{x),Vu{x)), 

where the drift bi^{x) = b{TxUj) is generated by the action of on w. Let be the corre¬ 

sponding Markovian measure starting at 0 € R'^ at time 0 for each oj £ Q. We would like to have a 
large deviation principle for the distribution £ •) almost surely with respect to P. 

The first step is to “lift” the diffusion {Xt)t>o to the environment process Zot = TXt{^) taking values 
on H with inhnitesimal generator Ah = -|- 6 • V where V = is the gradient given by the 

generators 

(Vju) (Lu) = Inn- - - z = 1,..., a, 

of the translation semigroup A key step is to hnd a probability measure which is invariant 

under A and absolutely continuous with respect to P, i.e., to hnd (j) £ L^(P) with (/>> 0, f (()dP = 1 
such that 4> satishes 

\xdg = V-{h<t>). 


If we fancy that such a function 4> exists, then the measure i;i)dP is ergodic (see Kozlovf [K85j l and 
Papanicolau- Varadhan ( |PV81j B. Hence, by the ergodic theorem, 

lim - [ f{uJ{s))ds = f f{u;)c/){u;)dF P — almost surely, 
t^oo t Jq J 

for any test function / on H. This also translates to an ergodic theorem on 
process g{uj,Xt) = f{Txt^)-- 

lim - [ g{uj,Xs)ds= f /(a;)(/>(a;)dP Pq’‘^ — almost surely. 

t^OO t Jq j 


for the stationary 
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If we write down the martingale problem Xt = Wt + Jq b{io, Xs)ds, where {Wt)t>o is a standard 
Brownian motion, then we have the law of the large numbers 


lim —- = lim 

t^OO t t—)-oo 


b{u},Xs)ds= / b{uj)4>{uj)d¥, 


almost surely with respect to P and . 

Unfortunately, for any arbitrary drift b, it is very difficult to find an invariant probability (/>dP. An 
easier task is to do the “converse”: Start with a given probability density (p and find some b so that 

1 


-AcP = V-{bp), 


( 2 . 1 ) 


i.e., 0dP is an invariant density for the generator iA + 6-V (take for instance, for any p > 0,b = 

This is indeed the task set we forth for getting the large deviation lower bound. Let £ be the class of 
pairs (^,6) so that the probability density p satisfies (j2.1|] . We tilt the original measure to 
for any such [p, h) € £. The cost for such a tilt is given by the relative entropy 


log 


dP^‘ 


= 


2 Jo 


6(a;(s)) - 6(w(s))|| ds 


by the Cameron- Martin- Girsanov formula (here || • || denotes the Euclidean norm in M“). We remark 
that for any such pair [b, p) G T, ijidP is ergodic for the environment process with generator i A -|- 5 • V 
and hence the ergodic theorem on fl, again by stationarity, translates to the ergodic theorem on 
implying 


1 


-E 


b.LJ 


t 

and 

^ . 
t 


|6(a;(s)) — fe(a;(s)) Ipds 


\\b{uj) — b{ui)\\‘^ p{uj)dF P — a.s. 


( 2 . 2 ) 


b{uj)p{u})dF, 


almost surely with respect to P and We fix 0 G M'^.Then the measure tilting argument, combined 
with (|2.2p . leads to the lower bound 


liminf - logP^’^^je^^’^*^} > sup {{0,x) —I{x)] 


where 


= sup 
ib,<i>)ee 

=■■ H{e\ 

1 


dFp[{e;h)--\\b{bo)-b{oj)f 


(2.3) 


I{x) = >f 

(6,</>)££ 2 
E(b0) = cc 


\\b{uj) — b{uj)\\'^p{uj)dF. 


The desired large deviation principle would follow once we prove the corresponding upper bound 

limsup-logP'’’‘^{e<®’^*>} <H{e), 

t^OO t 


(2.4) 
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which contains the heart of the argument. In this context (12.11) has an important consequence: For 
any suitable test function g on 


1 

2 




\^ 9 + (ft, V 5 ) 


= 0 for all g if (6, (f>) G 8, 

/ 0 for some g if (6, (j)) ^ f, 


and hence, by taking constant multiples of the left hand side if (6, (p) ^ 8, 

1 , f . ,7 _ V 1 f 0 \i (h,4>) ^ 8, 

—00 else. 


inf - / dP(?i<j Afif + (6, Vg) \ = 


Hence, we can rewrite 

H{e) = sup sup inf I j dF4>(^{e,b) - ^\\b{uj) - b{uj)f + ^{Ag + {b,Vg)} 

> inf sup j dFpl^^Ag + {b,e+ Vg)+ ^\\e+ Vgf 

= inf ess sup j ^A 5 ((w) + {b{uj),0 + S/g{uj)) + i||(9 + Vc/(a;)|p 1, 

9 a;—P t ^ J 

where the lower bound above is a result of an approximation argument, subsequent min-max theorems 
from convex analysis and Lagrange multiplier optimization for b. From this lower bound we would 
like to get some “function” ^ on H, which is a (sub) -solution to 

-Ag{oj) + {b{u!),9 -|- Vg{uj)) + -\\9 + V 5 (w)|p < H{6) P — almost surely. 

It turns out that although g does not exist as a function, under certain technical conditions (see 
below), its gradient G = Vg exists in as a function in some L^(H) (i.e, has p-th moment) and 
satisfies E(G) = 6 (i.e., has mean 0), V x G = 0 in the sense that S/iGj = VjGi (i.e, satisfies closed 
loop condition) and P- almost surely, 

^V.G + {b,G) + ^\\Gf<H{e), 

in the sense of distributions. Via the closed loop (and p-th moment) condition, one can define the 
function ( a corrector) ^'(a;, •) € IF[q’^(]R'^) as 



^(w,x)= [ {G,dz{s)) (2.5) 

where ^'(0,0;) = 0 P- almost surely and z{s) is any path connecting 0 and x. Now the large deviation 
upper bound (12.41) follows by a maximum principle argument once we show that 'L has a sub-linear 
growth at infinity, i.e., 

Il'I'lloo = o(||x||) as ||x|| —>■ 00 (2.6) 

P- almost surely. This requires substantial technical work and one crucial assumption for this is, 
existence of p > d and q > p so that 

ci||Vu||P-C2 < i||Vnf + (6,Vu) < C3||Vu||''- C4 


(2.7) 
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and the condition p > d allows one to invoke Sobolev imbedding theorem to (locally) control ||'k||oo- 
Then (12.6p implies the upper bound 

inf esssup|^||Gf + (6,G) + iv-Gl <F(0) (2.8) 

G:VxG=0 P L ^ ^ J 

EG=9 

Combined with the lower bound (j2.3p . this shows that, in fact equality holds above and proves the 
existence of the moment generating function (and hence the desired LDP). 

We will end this formal discussion on diffusions with random drift by pointing out that the above 
arguments are in fact equivalent to a quenched homogenization effect: Getting the required function 
G as in ()2.5p is equivalent to getting an upper estimate on the solution of 

dtu = ^Au+^\\Vuf+ {b,Vu) 
n(0, x) = {9, x), 

On the other hand, the Cole-Hopf transform n = e“ solves 

dtv = + (6, Vn) 

n(0, x) = 

which has a Feynman-Kac representation v{t,x) = Hence, studying the large deviation 

asymptotics (recall (|2.3I) and (I2.4p i limj^oo j logn(t, 0) is same as studying lim£^o^logn(l/e,0) = 
lim£_>.o tte(l, 0) where Us solves the rescaled version of (12.9p 

1 1 

dtUe = + -llVuell^ + {b{x/e,uj) ,Vu) 

Ue{0,x) = {6,x). 

The limit satisfies _ 

dtu = H{Vu) 

u(0, x) = {6, x), 

where the effective Hamiltonian H again has the variational formula (j2.8p . 

3. The ergodic theorem and the lower bound. 

In this section we start with the proof of the lower bound asymptotics in Theorem 11.11 and Theorem 
[Ql We need some input from the environment seen from the particle, which, with respect to a suitably 
changed measure, possesses important ergodic properties. 

3.1 Markov chains on environments and the ergodic theorem. 

Recall that, given the transition probabilities vr from for Pq- almost every u> G fig, the process 

(Tx„cv)n>o is a Markov chain with transition kernel 

(R^f)(uj) = '^TT^{0,e)f{TeU;), 

e 

for every / which is measurable and bounded. 

We need to introduce a class of transition kernels on the space of environments. We denote by H 
the space of functions tt : Hq x [0,1] which are measurable in Hq) Z^eGUd = 1 for almost 

every a; G Hq and for any w G Hq and e G U^, 

7r(a;, e) = 0 if and only if 7r,j(0, e) = 0. 


(3.1) 
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For any vr G II and w G Oq, we define the corresponding quenched probability distribution of the 
Markov chain {Xn)n>o by 


= 0 ) = 1 

P^'‘^{Xn+l = x + e\Xn = x) = ^{TxUJ,e). 


(3.2) 


With respect to any ff G II we also have a transitional kernel 

eeUd 


for every measurable and bounded /. For any measurable function (j) > 0 with J (/)dPo = 1, we say 
that the measure 0dPo is i?^-invariant, or simply vr-invariant, if, 


(p{uj) = ^ 7r(r_eW,e)(/)(r_ew). 
eGUd 


Note that in this case, 

J /(w)(/>(a;)(iPo(w) = J{Rjif ){uj)(l){uj)dFo{uj), 

for every bounded and measurable /. 

We denote by £ such pairs of (tt, cj)), i.e., 


(3.3) 


£” = I (tt, (/)): TT G n, (/> > 0, Eo((/>) = 1, (/>dPo is vr — invariant j. (3.4) 

We need an elementary lemma which we will be using frequently. Recall the set from (II.5p . 

Lemma 3.1. There is a one-to-one correspondence between the sets Xii and £. 


Proof. Given any (vr, 4>) G £, we take 

d/r(w, e) = 7r(w, e)4>{uj) dPo 

= 7r(a;,e) Tr{u',e)(p{uj')dFQ. 


(3.5) 


e:TeU)'=LO 


By (II.4p . Po-almost surely, 


d(/i)i(w) = dn{u},e) 

eGUd 

= ,e)(f>{uj') dFo 

e:Te0j'=0J 

= Y b/r(w',e) 

e:reu'=LJ 

= d(^)2(a;). 

Hence, (/u)i = {fi )2 <S Pq- Furthermore, for any edge e G being open in the configuration io (i.e., 
w(e) = 1), 

dfi{uj, e) 


d{p,i){u) 


= Tr{oj, e) > 0, 


recall m- Hence p, G Xi^. 
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Similarly, given any // G we can choose 




d(/i)i \ 

d(^)i’ dPo J 


€ £. 


□ 


We also need an ergodic theorem for the environment Markov chain under any transition kernel 
TT G n. This result is standard in the elliptic case. The proof below is an adaptation of the standard 
proof to our non-elliptic setting. 

Theorem 3.2. Fix vf G IT. If there exists a probability measure Q -C Pq which is ir-invariant, then 
Q ~ Pq and the environment Markov chain with initial law Q and transition kernel vf is stationary and 
ergodic for Pq. Moreover, there is at most one probability measure Q which is vf- invariant probability 
and is absolutely continuous with respect to Pq. 


Proof. We hx TT G n and let Q Pq be tt- invariant. We prove the theorem in three steps. 

Let us first show that, > 0 Pq- almost surely. This will imply that Q ~ Pq. 

Indeed, to the contrary, let us assume that, 0 < Po(^) < 1 where A = > O}. Then, 

Q ~ Po(-|j 4 ). If we sample ui G Hq according to Q and U 2 according to 7 f(a;o, •)) then the distribution of 
UJ 2 is absolutely continuous with respect to Q (recall Q is fr invariant) and thus, on the distribution 
of W 2 has zero measure. 

This implies that, for almost every oji ^ A and every e G such that 7f(a;i,e) > 0, TeWi G A. 
Since tt G 11, for almost every wi G A and every e G such that 7r(a;i,e) > 0, TgUJi G A. Now if 
we sample ui according to Po(-|A) and ijJ 2 according to 7r(a;i,-), then, with probability 1, U 2 G A. 
In other words, A is invariant under vr (more precisely, A is invariant under the Markov kernel Rtt). 
Since Pq is 7r-ergodic (see Proposition 3.5 in [BB07| i. Po(^) G {0,1}. By our assumption, Po(^) = 1. 

Now we prove that the environment Markov chain with initial law Q and transition kernel tt is Pq 
ergodic. Let us assume on the contrary, that for some measurable D, Q{D) > 0, Q(T>‘^) > 0 and D 
is if invariant. Hence Po(T)) > 0 and Po(D'^) > 0, by the last step. Further, the conditional measure 
Qd(') = Q(‘|F>) is TT invariant and Qd <C Pq- But = 0 and hence, = 0. This 

contradicts the first step. 

To conclude the proof, we need to prove uniqueness of any Q which is if- invariant and absolutely 
continuous with resect to Pq. Let be the space of the trajectories (...,a;_i, wq, wi,...) of the 
environment chain, /tq the measure associated to the transition kernel if whose finite dimensional 
distributions are given by 


s((' 


U-r 



n—1 

Q(da;_„) if(w^, dw^+i). 

j=-n 


for any finite dimensional cylinder set A in Let T : —)• be the shift given by {Tuj)n = uJn+i 

for all n (z Z. Since Q is if- invariant and ergodic, by Birkhoff’s theorem, 


lim 

n—>-oo 


n—1 « 

k=0 


IXq (and hence ^pp) almost surely for any bounded and measurable g on Since the environment 
chain {TXf.w)k>o has the same law in f P^’^^dQ as (cuq, wi,...) has in yiQ, if /(wq) = g(uJo,uJi, ■ ■ ■), 
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then 


n —1 n —1 „ „ 

lim y] / o TXfc = lim y 5 o = / 

1^00 ' n—>-oo J J 


= / /dQ, 


fc=0 fc=o 

for any bounded and measurable / on The uniqueness of Q follows. 

Corollary 3.3. For any pair (vf , (p) € £ and every continuous and bounded function / : x 

1 f 

lim - y /(rxfco;, - X^) = / dPo y /(<^, e)7f (w, e) Pq - a/mosf surely. 

n^oo n ^ /o„ 

k=0 e 

Proof. This is an immediate consequence of Theorem 18.21 and Birkhoff’s ergodic theorem. 

3.2 The lower bounds of Theorem 11.11 and Theorem 11.21 

We are ready to deduce the lower bound (jl.Sp . Recall the definition of 3 from (jl.bl) . 
Lemma 3.4. For any open set Q in A4i(no x U^), Pq- almost surely, 

liminf — logPP^^fTn & G) > — inf J(//) 

n-j-oo n 


□ 


□ 


= -infa**(/r). 


(3.6) 


Proof. For the lower bound in (|3.6p . it is enough to show that, for any g E M.\ and any open 
neighborhood U containing g, 

liminf — log ^U) > —3(g,) 

n^oo n ^ ^ ' 

Given /i E Xii, from Lemma l3.II we can get the pair 

( dh d{n)i\ 

and by Theorem 13.21 

lim P^’^iCn^U) = 1. 

Further, 

dPPf- 


(3.7) 

(3.8) 

(3.9) 




' dPo^’'^ 


Hence, by Jensen’s inequality. 


= / dPo’^’‘"<j Il|£^gi^}exp<J - log 


dPo^’^^ 


lim inf — log PP^ iCn € > hm inf — log PP^ iCn ^U) 

n—>-00 n ^ ^ ' n—>-00 n ^ ^ ' 

1 


— lim sup 


n-foo nPQ’‘^{Cn E U) J{C„eU} 

= - [ dPo(a;) ^ if (w, e) log 

J vr^We 

= -^(/^), 


dPn"’" log 


dP^’‘^ 

dPP^ 
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where the first equality follows from (13.91) and corollary 13.31 and the second equality follows from (j3.8|) . 
This proves ()3.7I) . Finally, since G is open, inf^ggfJ(/r) = inf^gg fJ**(/x) (see [R7n| i. This proves the 
equality in (j3.6jl and the lemma. □ 

Corollary 3.5. For every continuous and bounded function f : Qq x \]d —M and for ¥q- almost 
every ui & Qo, 

liminf-log£(o’'^|exp r^/(rXfca;,Xfc+i> sup {{f,/a) - 3{n)} 

^ ^ ^k=o (3.10) 

= sup {(/,Ai) - J(^)}. 

At€A4i(OoxUji) 

Proof. This follows immediately from Varadhan’s lemma and Lemma 13.41 □ 


4. A CLASS Goo OF GRADIENTS. 


We introduce a class of functions which will play an important role for the large deviation analysis to 
follow. We say that a function G : GIq x ^ M is in class Goo if it satishes the following conditions: 


• Zero mean: For every e € U^, 

Eo(G(-,e)) =0. (4.1) 

• Uniform boundedness. For every e G U^, 

ess sup G(uj,e) = M < oo. (4.2) 

CJ—Po 

• Closed loop. Let (xq, ..., x„) be a closed loop on the infinite cluster Coo (i-e., xq, xi,..., x^ 
is a nearest neighbor occupied path so that xq = x^). Then, 

n—1 

G{TxjUJ, Xj+i — Xj) = 0 Pq — almost surely. (4.3) 

j=0 

For any G € Goo, the closed loop condition has two important consequences in the present context: 
First, along any nearest neighbor occupied path (xq, xi,..., Xn) so that xq = 0 and x^ = x on Coo, we 
can define the function 

n—1 

Tg(w,x) = '^{uj,x) = '^G{TxjUJ,Xj+i - Xj) (4.4) 

j=0 

by the closed loop condition, this definition is independent of the chosen path for almost every ui G 
{x G Coo}. 

Secondly, again by the closed loop condition, T satishes 

• Shift covariance: For Po-almost every w G Hq and all x,y G Coo, 

T(w,x) - T(w,?/) = T(ryW,x - y). 

For any given G G Goo, let us hx which satishes an important property. 


Theorem 4.1 (Sub-linear growth at inhnity on the cluster). For any G G Goo, '1' ho,s at most sub- 
linear growth at infinity on the infinite cluster ¥q- almost surely,. In other words, 


^ |'I'(w,x)| 

hm max - - = 0. 

n—>-00 x€Coo 

|a:|<n 


n 
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Before we present the proof let us collect some useful facts which will finish the proof of the theorem. 
First, we start with a weaker version of the above result. 

Lemma 4.2 (Sub-linearity on average). For every e > 0, 

lim —7 If,.. , 1=0 Pn — almost surely. (4.5) 

n-,oon<^ 1 |4'(a:,a;|>£nl ^ ^ ^ ^ 

\x\<n 


Proof. This follows verbatim the proof of Theorem 5.4 in [BB07] (Interestingly, in [BB07] . along with 
the mean zero and shift-covariance, 'll being only square integrable with respect to Pq is enough to 
deduce the above result). □ 

We also need the following version of the classical result of Antal-Pisztora l [AP96] i about the 
chemical distance of two points in the cluster. Indeed, for p > Pc{d) and x,y € Coo, let dchix, y) denote 
the minimal length of an open path connecting x and y. 

Lemma 4.3. Fix <5 > 0. Then there exists a constant c = c{p, d) such that, Pq- almost surely, for every 
n large enough and points x,y & Coo with \x\ < n, \y\ < n and \x — y\ < 5n, we have dg^(x,y) < c5n. 

Proof. The statement of this lemma, which is slightly stronger than what is stated in Antal-Pisztora 
f [AP96] i follows from Lemma 2.14 in [DGKOlj . □ 

We need another elementary fact. Let 9{p) denote the percolation density, i.e., 9{p) is the probability 
that 0 is in the infinite open cluster. 

Lemma 4.4. Fix d > 0. For every n large enough, in a ball of radius dn in [—n, n]'^ there are at least 
5^{2nY points in Coo- 

Proof. This is an easy consequence of Birkhoff’s ergodic theorem. □ 

We are now ready to prove Theorem 14.11 

Proof of Theorem \4.1\ Let e > 0 be arbitrary, 5=5 ^ n large enough so that the following 

three implications hold: 


l/iT/ 1 \ < 

cc|<n 


(4.6) 

• For any x,y € Coo with x| < n, \y\ < n and \x — y\ < 5n, 



d(.j^(x, y) < cdn 


(4.7) 

• 

^{points in a box of radius 5n in [—n, in Coo} > 2en'^. 


(4.8) 

These are consequences of Lemma 14.21 Lemma 14.31 and Lemma 14.41 respectively. We note that, for 
such small e > 0 and large n and every x G [— n, n]*^, there exists y G [—n,n]'^nCoo so that \y — x\ < 5n 
and |'I'(a;,x)| < en, Po-almost surely. Indeed, by (14. 6h there are at most en'^ points z G [— n,n]'^ such 
that 'k(a;, 2 ;) > en and by (j4.7[). there are at least 2en'^ points in Bns{x) nCoo- Hence, we have at 


least one point y G [—n, n]'^ fl Coo such that \y — x\ < Sn and |'I'(a;, y)\ < en, Pq- almost surely. 
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Recall the definition of \1' from (|4.4|) . Then, by (I4.7|) . 

- T(a;,y)| < d^Yii^,y)ess sup G{u:,x) 

u}—Po 

< cSnM, 

for some M < oo, recall (14.21) . Hence, Pq- almost surely, 

|'h(a;, x)| < |'h(a;, y)| + c5nM 
< en + c5nM. 

Since e > 0 is arbitrary. Theorem 14.11 is proved. □ 

We end with a corollary to Theorem 14.11 


Corollary 4.5. Let G G l/oo- Pbr every e > 0, there exists = Ce(a;) so that, for every sequence of 
points {xk)^=Q on Coo with xq = 0 and \xk+i — x^l = 1, 


n—1 

G{Txt^UJ,Xk+l - Xk) 

k=0 


< Cs + ne. 


In particular, 


n—1 

G{tx^u}, Xk+i - Xk) > -Ce - ne. 

k=0 


(4.9) 


5. Limiting Logrithmic moment generating functions: proof of Theorem 11.21 


In view of Corollarv l3.5l Theorem ll.2l will be proved as soon as prove an upper bound of the limiting 
logarithmic moment generating function which matches the right hand side of (|3.10p . We first prove 
an upper bound based on the sub-linear growth property of gradient functions from the last section 
and subsequently show that this upper bound matches the lower bound in (j3.inp . 


5.1 The upper bound. 

Lemma 5.1. For Pq- almost every uj G CIq, 

1 f 

lim sup — log Eo,a; 1 exp (E f{TX fc^) 

n^oo n I 


Xk+i-Xk)}^ 


< inf A(/,G), 


where 

A(/, G) = ess sup log E'm.) =i}7r^(0,e)exp {/(w,e) + G{u,e)], 


Proof. Fix G G Goo- By the dehnition of the Markov chain Pq’‘^ we have, 

exp |/(rXfcW, Wfc+i - Xk) + Ginx^^^oj, Xk+i - Xk) 

= 7T4Xk,Xk + e)efGx,u.'^)+G{rx,u:,e) 

|e|=l 



|e|=l 

where the uniform upper bound follows from (15.Ih . 


(5.1) 
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Invoking the Markov property and successive conditioning, we have 


El 


wl'^(f{Tx^,uj,Xk+i-Xk) + G{TXf,uj,Xk+i-Xk)]\\ (5.2) 

k=o ^ ^ ^ 

Plugging the lower bound (|4.9p in (15.21) . dividing by n on both sides, taking logarithm and passing to 
lim sup„_^oo ''^6 have the upper bound 

^ f n-l . 

lim sup — log Eo,ij I exp IE f{Tx,io,Xk+i-Xk)} \<A{f,G)+s. 

n-^oo I J 


Passing to e —>■ 0 and subsequently taking infcegoo finish the proof of the lemma. 


□ 


5.2 Equivalence of bounds: Variational analysis. 

We pick up from the lower bound (j3.10l) and denote this variational formula by 
H{f) = sup {(/,//) -3'(//)} 

= sup I /'dPo(^^)</>(w) V 7r(a;,e)|/(a;,e) - log 
(*,<A)G5 U 1^^ I 

recall from Lemma [Q the one-to-one correspondence between elements of the set M\ and the pairs 
8 (and (13.8p . (jl.filP . In this section we show that H{f) equals the upper bound obtained in the last 
subsection. Modulo some care about containment in the infinite cluster, the line of arguments follow 
parallel to |KRVr)6| (and also |R,06] . [YDS] ). 

Proposition 5.2. For every e > 0, there is some G Goo so that 

Aif,Ge)<H{f) + e, (5.4) 

where A(/, G^) is defined in (15.111 . Hence, by Corollarv \3.5\ and Lemma FOl 

:H^(/)= inf A(/,G). 


7r(a;, s) 1 1 
7ra;(0,e) / /’ 


Proof. We split the proof into several steps. 

Step 1: Convex analysis. 

We recall the definition of pairs 8 from (13.311 and note that H{f) can be rewritten as 


sup sup inf 
<t> sen ® - 

since, by (13.311 . 


eeUd 


dPo(w)(^(a;)<l ^ 7r(u;,e)( f{uj,e) - log ) - [gireu:) - g{uj)} 


inf / dPo</’(‘^) E gijeU}) - g{uj) 

^ eeUd 


0 if (</), if) € 
—oo else. 


with the infimum over g being taken over all bounded B- measurable real functions. 


(5.5) 


In order to proceed with the variational analysis, we would like to swap the order of sup-g and 
infg in (j5.5p by invoking a min-max theorem which requires a compactness argument. We choose a 
sequence of finite cr-algebras {Bk)k>i so that, for each k, B^ contains information about all open (and 
hence closed) bonds in the box of size k around the origin. Then, for each k>l, vr. (0, e) is measurable 
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with respect to Bk, for each e € U^. Furthermore, C for all e € and B = cj(Ufc>i0fc), 

where B is the Borel a- algebra. 

For each > 1 we can restrict both the supremums in (15.5p to i3fc-measurabale probability densities 
(j) and Bfc-measurable it G FI and get a further lower bound 


H (/) > sup sup inf 


fen 


dPo(a;) 4>{uj) 


^ TT{u,e)(f{uj,e) - log 
eeUd ^ 


7r(w, e) 
Ta;(0,e) 




Since each B^ is finite, the supremum over tt is taken over a compact set. Further, since the integral 
above is concave and continuous in it and linear (in particular, convex) in g, a min-max argument 
([ESS]) allows us to change the order of sup-^j^ and inf^, leading to 


H (/) > sup inf sup 
<l> ^ ffen L 


dPo(a;) 4>{uj) 

^ TT{u,e)i^f{uj,e) - log 


7t(w, e) 

Ta;(0,e) 




(5.6) 


We can take conditional expectation of the integrand above with respect to Bk and use that both (p 
and it are Bfc-measurable to rewrite the right hand side above as 


sup inf sup 
<t> ^ ffeff 



log it(a;, e 



(5.7) 


where 

h{uj,e) =Eo{fog7ra;(0,e) + /(w,e)+ 5 ((w) - g{TeUj)\Bk] 

= log7r^(0,e) +Eo{/(a;,e) + g{uj) - g{TeUj)\Bk}, 

and we used that by our choice, vr is also Bk- measurable. Staring at (|5.7I) we note the local dependence 
of the integrand on it allowing us to bring the supremum over it inside the integral, leading to 


H{f) > sup inf 

(p 9 


dPo(a;) <p{uj) sup 

ttGII 


^{uj,e){h{uj,e) 

e£Vd 


log it (w. 



A direct Lagrange multiplier computation shows that the supremum in the integrand above is 
attained at 

Qh{u),e) 

it(w,e) = -TT-r 

_ 7r^(0,e) exp {Eo(/(u;,e) + g{uj) - g{TeUj)\Bk)} 

E|e|=i^‘^(0>e) exp{Eo(/(a;,e) +5f(a;) - g{TeUj)\Bk)} 

Replacing this value in the integrand leads to the lower bound 


G n(a;, e). 


F7(/) > sup inf 

(j) 9 


J dPo(a;)(/)(a;) 

eeUd 


(5.8) 
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Again using a similar min-max argument to change to the order of supremum and the infimum and 
subsequently replacing the supremum over (p with the integral f dPoi;/> by ess sup^_pj, we arrive at the 
lower bound 


J^(f) > inf ess sup 

^ CJ— Po 



Note that, we can restrict the above lower bound to 


ff(f) > inf ess sup 

5 a;—Pn 


log ^ ]l{<^(e)=l}7ra;(0,e)^ 


Eo-lf(uj,e)+g(uj)-g(TeUj)ll3kj | 


(5.9) 


Step 2: Approximate gradient and uniform boundedness. 

This implies, that, for every e > 0 and k > 1, there exists a bounded jB measurable function so 
that, for Pq- almost every a; € flo) and e € U^, 

^{oj(e)=l}^o{9k,e(i^) - ffk,e(TeUj)i^k} < -^{uj(e)=l}^OgTruj{0, e) + \\f\\oc, + H{f) + e. (5.10) 


We set, 


Gk,s{(^,e) = l{a;(e)=i}IEo{fffc,£(a;) - gk,s{re^)\t3k-i}- (5.11) 

We would like to show that, for every e > 0, the family {Gfc^e(-, e)}fc>i is uniformly bounded in the 
essential supremum norm. 

First, by taking conditional expectation on both sides of (j5.inj) with respect to Bk-i we have 


Gk,e{^,e) < -Eo{]l{a;(e)=l} log TT^ (0, e) } + ||/||oo + T{f)+e 

= -]l{a;(e)=i}logvr^(0,e) + \\f\\oc + Hif) + e (5.12) 

< log(2(i) + ll/lloo + H{f) + 


since the random walk transition probabilities are bounded away from zero on the event {w(e) = 1}. 
We can also reverse the argument to get a lower bound: Indeed, if H{uj,e) = Hk^£{uj,e) = gk,e{^) — 
gk,e{Te^)-, then H{u, —e) = —H{T-eUJ,e). Now applying (j5.10p again for the edge —e, we have 

r(/) + ll/lloo - ]l{^(-e)=i}log7r^(0,-e) + e > Eo{H{uj,-e)\Bk}l{oj^-e)=i} 

= Eo{ - ^{T-,uj){e)=l}H{T-e(^,e)\Bk} (5.13) 

=Eo{ - ^L,ie)=i}Hiuj,e)\TeBk}, 

where the second equality follows from the symmetry of the conductances. Again taking conditional 
expectation on both sides with respect to Bk-i and recalling that Bk-i C TeBk, we have 

-Gk,e{^,e) < -Eo{l{^(_e)=i} log7r^(0,-e)|;Bfc-i} + \\f\\oo + H{f) + e 

= -ll{(^(-e)=i}log7r^(0,-e) + \\f\\oo + H{f) + e (5.14) 

< log(2d) + ll/lloo + H{f) + £, 


since the transition probabilities are again bounded away from zero on the event {a;(—e) = 1}. We 
combine (j5.12l) and ()5.14p to conclude that, for some non-random constant M < oo, 

ess sup Gk,e{oj,e) < M \/k > 1. (5.15) 

a;-Po 


Step 3: Convergence to the gradient G, € ^oo. 
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Since is a uniformly bounded family, it is weakly compact and weakly converges, 

possibly along some subsequence, to some Ge(-,e), which clearly has Eq expectation zero and is also 
uniformly bounded in the essential supremum norm. 

Let (xo ,..., Xn) he a closed loop on Coo (i-e., xq, xi,..., x„ is a nearest neighbor occupied path so 
that xo = Xn)- Note that, weak convergence preserves conditional expectation. Then, for any fixed 
I > 1 , 


Eos '^GeiTxjUJ,Xj+l -Xj) Bi \ 
^ j=0 ^ 


n-1 


Eo< lim Gk,£iTx.uj,Xj+i 

I ' fe—>-oo 


j=0 


— Xi 


Bi 


n—1 


j=0 J J \ / 

j =0 ^ ^ 

y^^li^Eo|Eo^ 5 fc,£(Tx,.w) - gk,e{'rxj+-,^)\T-xjBk-^ 


Bi 


For k large enough, Bi C T-x^Bk-i and hence, by the tower property the last term equals 

A 


^3 
n-1 


lim Eo( gk,e{Txi(^) - gk,eirxi+i‘-^) 
^^ fc —>-00 V 


j=0 


lim y^Eol - gk,ei'rxj+i^^) Bi j 

3=0 ^ ^ 


= 0 . 


This implies that, 

n—1 

G^{tx^oj, Xj_|_i — Xj) = 0 Pq “ almost surely. 

3=0 

We conclude that € Qoo- 


Step 4: Conclusion. 

Recall the lower bound (|5.9p . Since Eo(/(w,e)|0fc_i) is a bounded martingale, it converges almost 
surely to f{-,e). Hence, 

lEo(l{a;(e)=l}log7r^(0,e) +/(w, c) +Gk,£{u3,e) 

converges weakly in LP{Fq) for any p > 1 to ]l{^(e)=i} log7rt^(0, e) + /(a;,e) + Ge(-,e). By Mazur’s 
theorem, there exists a finite convex combination of the above terms which converges strongly 
to l{^(e)=i} log7rt^;(0, e) + f{u},e) + Ge{-,e) in L^’(Po) and hence, along a further subsequence to 
l{(..;(e)=i} log7r^(0,e) + /(a;,e) + G^{-,e), Pq- almost surely. 
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We take conditional expectation on both sides of (IS21) with respect to and use Jensen’s 

inequality to get 

log ^ exp{]l{^(e)=i}log7r^(0,e) + Eo(/(w,e)|;Bfc_i) +Gk,s{'^,e)} < H{f) + e. 
e&id 

Again, applying Jensen’s inequality to the aforementioned convex combination, and subsequently 
taking k ^ oo, we get 

log ^ ]l{^(e)=i} log7r^(0,e) exp {/(a;,e)+ 0 ^( 0 ;, e)} <H{f) + £. 

eGUd 

This finishes the proof of Proposition 15.21 

□ 


6. Large deviation bounds 

We start with a lemma which proves an important property of of the functional 3 defined in (ll.6p . 
Lemma 6.1. 3 and 3** are convex in Ali(no ^ Urf). 

Proof. Fix X € (0,1) and /U, u G AJi(no x Urf)- It is enough to show that if /r, u € then 

3{x^ + (1 ~ x)v) < x3{fi) + (1 — x)3{v). 

For /r, u G we define, 

<''‘’^'‘>-(< 1 (^) 1 ’ dP„ )• 
du d(u)i 


(ill/) (j^u) — 


d(u)i’ dPo 


Then, by Lemma[3Tl (7f^,(^^), G £ and 

= / dPo(/>^(w)^7f^(a;,e) log 

e 

3{u)= d¥o4>^{u})'^TT^{uj,e) log . 

J , vr^(0,e) 


7 r^( 0 ,e) ’ 
7fj,(a;,e) 


Denote by A = x/x + (1 — x)u. Then (tt, (f) = 11 ^) where 

(j) = X(f)fj. + {1 - x)(f^, 7f = i/vf^ + (1 - y)7fi., 
with y{-) = Then, 

J(A) = 3{xy + (1 — x)u) 


dPo(/>(a;) ^ fr(w, e) log 

'^ui[o,e) 


< x| y d¥o(f)f,{uj)'^TTij_{u},e) log^lt^l|y| + (1-x)| y dPo0i.(w)^7f,,(a;,e) log 
= x3{fj.) + (1 - x)J(u), 

where we used convexity of the function / i-a / log / and Jensen’s inequality for the upper bound. □ 


vi'i.(w,e) 
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We end with the proof of Theorem 11.11 and Corollary 11.31 
Proof of Theorem \1.1\ By Theorem 11.21 

lim -log.Eo’‘^{exp{n(/,£„)}} = sup {{f, fi) - 3{n)} = sup {(/,/i) - J(//)} = a*(/i). 

fieMiifioxUd) 

Since Oq is a closed subset of hi = {0,1}®'* and hence, is compact, Ali(no x U^) is compact in the 
weak topology. The upper bound (11.711 for all closed sets now follows from Theorem 4.5.3 [DZ98| . The 
lower bound (jl.8p has been proved by Lemma 13.41 □ 

Proof of Corollarv M.S]: The claim follows by contraction principle once we show that inf^(^)= 3 , = 

infg(^)= 3 , This is easy to check using convexity of 3 and J**. □ 

We finally prove that 3 is not lower semicontinuous. 

Lemma 6.2. Let d >2 and p > Pc{d)- Then 3 is not lower-semicontinuous on A4i(no x U^). Thence, 
3 / 3**. 


Proof. For any /3 > 1, we define 


where 


e) = 


'tp(e) = 




Se'GUd ’*/'(®0^{ij(e')=l} 

f /3 > 1 if e = ei. 


en. 


else. 


Let Xn ^ be the Markov chain with transition probabilities By |BGP03] and |S03| . there exists 
du = du{p, d) > 0 so that for /3 > Pu, the limiting speed 


vW 

T 

iim -, 

n^oo Ti 


which exists and is an almost sure constant (see [BGP03j and |S03j L is zero. Then, by Kesten’s lemma 
(see [K75]), there exists no ^ € L^(Po) so that f>) € £. We split the proof into two cases. 

Suppose there exists a neighborhood u of so that every if G u fails to have an invariant density. 
Then, for any G u and any probability density f G L^(Po)) let /i/j be the corresponding element 
in A4i(flo X Urf) (i.e., d//^(a;,e) = 7r^^^(u;,e)(/>(ti;)dPo(w)). Since 

by Lemma [3Tl ^ M.\. Then, 3{pLp) = oo by (11.61) . If 3 were lower semicontinuous on A4i(no x U^), 

then 3 = 3** and by Theorem ll.il 

Po^’‘^{£n € n} (6.1) 

would decay super-exponentially for Pq- almost every co € flo, with n being some neighborhood of 
p,^. However, since for every ca, the relative entropy of ■) w.r.t. 7r^(0, •) is bounded below and 

above, the probability in (16.11) decays exponentially and we have a contradiction. 

Assume that there exists no such neighborhood u of Let 7f„ —)■ such that for all n G N, vf^ 
has an invariant density fn and {Trn,4>n) G £. If (pn)n is the sequence corresponding to {Ttn,4>n), since 
A4i(Ho X Urf) is compact, pn ^ Pp weakly along a subsequence. However, by our choice of /3 > /3u, 
(f) ^ £ for any density f and hence pp ^ and 3{pjs) = oo. But, 


lim 3{pn) = [ dPo(()(a;) ^ 7r^'^'(a;, e) log 

n—)-oo / ' 

eeVd 


{uj, e) 

Taj(0,e) 


which is clearly finite. This proves that 3 is not lower semicontinuous. 


□ 
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